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Abstract — The paper gives an overview of multidimensional 
scaling (MDS) algorithm. Aside from brief introduction to 
MDS, its characteristics, its various types and its current 
applications has been discussed in wireless systems i.e. 
localization of wireless sensors in a network and determining 
the position of a mobile station without GPS. MDS is an 
algorithm by means of which information contained in a set of 
data is represented by a set of points in a low dimensional space. 
These points are arranged in such a way that distances between 
the points represents the relation between the data elements. 

Keywords — MDS algorithm, MDS types, MDS current 
applications, localization 

I.Introduction 

MDS is a set of data analyses technique. Information 
contained in a set of data is represented by a points in a 
multidimensional space such that the distance between the 
points tells about the relation between data sets i.e. their 
similarity or dissimilarity. Two data objects, if they are similar 
are represented by points that are close to each other in space 
and dissimilar objects are represented by points that are far 
apart. 

A. Example of MDS 

Suppose in a survey, it has been asked to compare the 
legal offences in term of their seriousness. The resulting data 
is represented in the form of a table 1. Each entry in the 
table tells the percentage by which respondents give their 
views of dissimilarity between offences. 

The given information can be represented easily by MDS 
analyses and represented the data in 2D space as shown in 
fig] 
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Figure 1: 2D scaling of data, of Table 1. 

In the fig.l, the distance between point 4 and 5 is 
small but 1 and 6 is large which shows that assault and 
receiving stolen gold are very unalike but burglary and libel 
are little alike according to respondents views. 

B. Literature survey 

MDS is a set of data analysis methods, which allow one 
to infer the dimensions of the perceptual space of subjects. 
The primary outcome of an MDS analysis is a spatial 
configuration, in which the objects are represented as points 
in such a way, that their distances correspond to the 
similarities of the objects. Their detailed study has been given 
in [1],[2],[3]. First well known MDS proposal was given by 
torgerson in 1952 that fits Euclidean distance model. It was 
based on metric analyses i.e. quantitative approach. In 1950, 
Attneave gives a Non-Euclidean distance model but it was 
not so successful. In 1962, Shepard gives psychometric 
approach. In 1964, Kruskal gives a non metric approach i.e. 
based on qualitative analyses. Till then new researches are 
going on and various softwares have also been introduced 
for MDS like INDSCAL, ALSCAL etc. MDS has been used 
in various applications. Main is used in wireless localization 
like sensor network localization and mobile station 
localization. 

A wireless sensor network [5] comprises of a large 
number of wireless sensor nodes and is randomly deployed. 
The location information does not only expand application 
scenarios, but also enhances performance of network 
protocols. In many applications, such as environmental 
monitoring, disaster detection, and battlefield monitoring, 
location information is vital. Deployments of wireless sensor 
net- works can be roughly categorized into manual settings, 
GPS-based approaches, and localization algorithms. It is not 
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practical to manually set sensor location, because it is too 
time-consuming and expensive. 

For the GPS -based localization method, the current/ 
existing GPS component is too big to fit into a tiny wireless 
sensor node. Another drawback is that GPS cannot be used 
in indoors. Locations of sensor nodes can be computed with 
the localization algorithm. A great deal of work over the past 
few years has focused on the sensor localization problem. A 
detailed survey of this area is provided in [6] . In the case 
when all pair- wise distances are known, the coordinates can 
be derived by using a classical method known as 
multidimensional scaling (MDS) [7], [8]. Shang et al. 
proposed the MDS-MAP localization algorithm using 
multidimensional scaling. MDS-MAP introduced in [9] can 
be mentioned as a well known example of this class where it 
computes the shortest paths between all pairs of nodes in 
order to approximate the missing entries of the distance 
matrix. Another algorithms that mainly consider the sensor 
localization as a non-convex optimization problem and 
directly estimate the coordinates of sensors is a relaxation to 
semi-definite programming (SDP) [10]. In [11] where they 
use matrix completion methods as a mean to reconstruct the 
distance matrix. Their analysis is based on a number of strong 
assumptions. First, they assume that even far away sensors 
have non-zero probability of detecting their distances. 
Second, the algorithm explicitly requires the knowledge of 
detection probabilities between all pairs. MDS MAP require 
weaker assumptions. It use shortest paths for the missing 
entries in the distance matrix and apply MDS to find the 
topology of the network. Here it is assumed that only 
neighbouring sensors have information about each other and 
that only connectivity information is known and the 
knowledge of detection probabilities plays no role in this 
algorithm. MDS-MAP still has the following drawbacks: 1) 
The time complexity is high. When there are a huge number 
of sensors, large bandwidth and computation are required to 
estimate locations. 2) In the case of a network area with non- 
convex shape, the computed coordinates are often erroneous. 
To resolve the drawbacks of MDS-MAP, a HMDS [12] is 
proposed which has low time complexity and which can 
operate in non-convex network environments. 

Mobile locations could support many applications, such 
as emergency services, roadside assistance, navigation, 
tracking, and so on [13]. With the popularization of wireless 
services, more and more people call for emergency services 
by wireless phone, and the system operators want to provide 
subscriber's location information which conforms to the FCC 
requirement. Location estimations can be done through many 
ways. The first is the Global Positioning System (GPS) [14], 
which provides accurate positioning but fails when satellite 
signals are blocked like indoor situations. The second is MS 
location based on RSS measurements [15]. This method is 
has to measure the power of the signal at the receiver and 
low accuracy is achieved in this way due to the complex 
propagation mechanisms. The third way uses the angle-of- 
arrival (AOA) [16] . However, this method requires a complex 
antenna array system and suffers from NLOS propagation. 
The fourth method is the TO A [17]. NLOS propagation is a 
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potential disadvantage of TOA method. The MDS has been 
proposed for wireless network based mobile location. Due 
to the MDS needs distance measurement data between the 
MS and BSs, the MDS technique has been proposed for 
mobile location using the TOA method [18]. It improves the 
performance of mobile location estimation. 

C. Algorithmic flow of MDS 

It is shown in fig. 2. Proximities are the data for MDS 
analyses. It can show similarity vs. A-similarity and can be 
symmetric vs. Asymmetric. MDS model can be metric/non 
metric with Euclidean distance or non Euclidean distance. 
Dimensions can be l,2or more depending on the parameters 
that are to be surveyed. Analyses can be individual, 
aggregated or weighted. 

Details are given ahead. Various softwares are used for 
MDS analyses that gives the final results of coordinates of 
data in the space. Most commonly used is ALSCAL 
developed by Forrest Young. 
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Figure 2: Flow for MDS analyses 
II. CHARACTERISTICS OF MDS 

There are four main characteristics that a researcher 
must consider while using MDS:- 

• The data : It is simply the data that is to be presented 
for MDS analyses. It can be in the form of matrix. Distinction 
between data can be made on 'way of data' and 'modes of 
data' . 

• Way of data : It simply represent the dimensionality 
matrix like 'One way data' It is like a single column matrix 
that can't be scaled i.e. can't apply MDS. 'Two way data' It 
is represented in matrix form and can be scaled by MDS. 

• Modes of data : It tells about the kind of matrix the 
input data has like 'One mode data' has rows and columns 
refer to the same set of objects. The matrix is square and 
symmetric. 'Two mode data' has rows and columns of matrix 
refers to different set of objects. The matrix is rectangular 
and asymmetric. 

The transformation technique : It rescales data in a matrix 
into an n -dimensional space. Two types :-monotonic and 
linear. 'Monotonic' preserves ordinal information of the data. 
It assumes rank order of entries in data matrix contain 
significant information. 'Linear' are expressible in simple 
mathematical form and are simpler. Data dissimilarities are 
direct estimate of distance between the points. 
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• The models The model represents data value as 

distance between the points in n- dimensional space. Main 
model used in MDS is 'Euclidean distance model' the data 
in the matrix is represented as 'distance like'. It can be 
unweighted and weighted. In 'Unweighted model', all 
dimensions are given equal importance. Distance between 
points i and j in dimension a is given by: 
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where x. a specifies the position of point i in 
dimension a. In 'Weighted model', the importance of each 
dimension is taken in form of weights. It gives the distance 
between the points i and j as: 
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where w ka is a 



diagonal matrix represented the 
importance of each dimension to each individual. 
• Scaling techniques: Here we categorize the data in 

such a way that the properties of original relations remain 
unchanged. The number of topologies has been proposed 
but the most long lived classification is that of Stevens (1946). 
He disguised data in nominal, ordinal, interval and ratio level 
of measurement that are shown in fig. 3. In MDS, mostly 
ordinal and ratio level are used. 

In 'Ordinal level' we categories the data on the basis 
of ranks Then plotting in n-dimensional space on this rank 
basis is done. It is further of two types: partial i.e. Some 
objects while ranking can come at same level, often results 
from missing data, and complete i.e. no missing data. 
In 'Ratio level' Numerical value of distance between objects 
between adjacent points is consider. 
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Figure 3: Various Scaling Techniques 



III.TYPES OF MDS 

MDS can be classified into various types :- 

• Classical MDS: - It takes only one input matrix 
giving dissimilarity between pair of items. It uses Euclidean 
distance model to model the dissimilarities. Two types :- 
Metric CMOS:- The data in the matrix represents quantitative 
similarities. Ratio and interval level of measurements are 
used. Transformation used is linear transformation i.e. 

I{S}= D+E where I{S} is linear transformation of 
similarities. D is Euclidean distance matrix. E is matrix of 
error. Data matrix is square and symmetric. 
Non metric CMOS: -. The data in matrix represents 
qualitative similarities. Ordinal level of measurement is used. 
Transformation is monotonic transformation, i.e. 

M{S} = D+E where m{S} is monotonic transformation 
of similarities. The model is Euclidean distance model. Data 
is two mode i.e. Matrix is rectangular and asymmetric. 

• Replicated MDS:- It permits the analyses of several 
matrices of similarity data simultaneously. It uses same 
Euclidean distance model as in CMDS but on several input 
data matrix not on a single matrix. It is also of two types 
metric and non-metric. 

• Weighted MDS: In RMDS the difference in 
responses of individuals is taken into account but in WMDS 
the cause of difference in response of several individuals is 
consider To take it into account weight w k is included that 
represents the importance of each dimension to each 
individual. So it is also called individual difference scaling 
(INDSCAL). 

It uses weighted Euclidean distance model. It is also of 
metric and non metric types. RMDS generates a single 
distance matrix D, while WMDS generates m unique distance 
matrices D , one for each data matrix S, . 

k' k 

IV.APPLICATIONS IN LOCALIZATION TECHNIQUES 

Applications of MDS are appearing at an increasing 
rate and at various disciplines such as 'Marketing ' for taking 
the preferences and perceptions of respondents and 'Data 
visualizations' tuning a metric space, in which points 
correspond to descriptors values, so that the metric space 
represents a perceptive distance between images. Here 
main discussion will be on :- 

• localization in a wireless sensor network using MDS 

• Finding the location of a mobile station in an area 
of a cellular network using MDS 

A. Localization in a wireless sensor network 

Determining the physical position of sensor from a 
large number of sensors randomly and densely deployed in 
a certain area is important to use the data collected by the 
sensor. Two approaches are proposed for localization of 
wireless sensor network: - 

a) Centralized approach:- In centralization approach, 
the inter-node ranging and connectivity data is migrated to a 
central base station and then the resultinglocations are 
migrated back to respective nodes. MDS-MAP is centralized 
algorithm used for network sensor localization that uses 
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multidimensional scaling. It uses classic metric MDS. 

Let there are n sensors in a network, with positions 
Xi, i = 1 . . . n, and let matrix X = [XI, X2, . . . ,Xn] T . X is 
nxm, where m is the dimensionality of X. Consider m to be 
an unknown. Let D = [dij ] be the nxn matrix of pair- wise 
distance measurements. The distance measurements dij must 
obey the triangular inequality: dij + dik >= djk for all (i, j, 
k). 

Classical metric MDS is derived from the 'Law of 
Cosines' , which states that given two sides of a triangle dij , 
dik, and the angle between them _jik, the third side can be 
computed using the formula: 

djk = tfs + <4 - Zdijdik cos & m 



Rewriting: 



1 



dijdik cos 9 jik = - (d^ + d ik - dj k ) 
Similarly, d. k = X k -X, d. k =X k -X. So rewriting we get 

(x, - x*) - (x k - x t ) = 1(4 + a? fc - «£) 

To find the positions X we have to choose some X Q from X 
to be the origin of a coordinate system, and construct a matrix 
B that is known as the matrix of scalar products and the 

values as follows: 

1 



*>ij = o(<% + 
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dij) 



Now, we get a new value of coordinates X' shifted according 

to origin X Where X.'=X.-X (1) 

Now B can also be written as: X'X' T =B 
We can solve for X by taking an eigen-decomposition of B 
into an orthonormal matrix of eigenvectors U and a diagonal 
matrix of matching eigenvalues V. 
B = X'X' T = UVU T 

X'= UV 1/2 We need to find X in 2-D space or 

3D-space. To do this, we throw away all but the two or three 
largest eigenvalues from V, leaving a 2x2 or 3x3 diagonal 
matrix, and throw away the matching eigenvectors U. Then 
X Q has the proper dimensionality. Now from (1) we can find 
Xi's. Hence we have determine the coordinates of sensors. 

This approach of MDS has some drawbacks as 
mentioned. A single pass of multidimensional scaling cannot 
operate on a large topology, MDS MAP also does not use 
anchor nodes very well as anchor density increases, 
calculated positions are very near to actual value but don't 
have exact values, This approach requires distance 
measurement for all pairs of nodes but some nodes are very 
far away from a particular sensor and distance can't be 
measured. 

To overcome these difficulties we use another 
method based on distributed approach. 

b) Distributed approach:- This approach is also 
called hierarchical MDS-based localization algorithm 
(HMDS). In this architecture, a sensor node may take the a) 
roles of cluster head, cluster member or gateway node. It 
makes an assumption that whole network is connected i.e. 
two node have at least one path. Also, each device supports 
RSSI (Received signal strength indicator measurement). 
HMDS has three execution phase which are as follows: 
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1) Clustering phase:- Here the whole network is divided 
into multiple clusters using various algorithms. 

2) Intra cluster localization phase:- Each cluster contains 
one cluster head, lots of gateways and many members. 
Cluster head collects distance measurements from all 
members of the cluster and compute distances of all pairs of 
devices using the shortest path algorithm such as the Dijkstra 
algorithm, so form a distance matrix of cluster members. 
Applying a MDS on a distance matrix computes the relative 
coordinates of each cluster member and forms a local map. 

3) Merge phase:- A gateway node participates in at least 
two neighbouring clusters, exchanges information between 
two clusters. Each gateway is assigned at least two 
coordinates from each cluster the gateway participated in. 
Different coordinates in the same gateway node implies that 
the coordinate systems in neighbouring clusters are different, 
and this information can be used to merge two coordinate 
systems into a unified coordinate system. 

These merged clusters repeatedly broadcast merge 
requests to neighbouring clusters and calibrate coordinates 
of the neighbouring clusters until the coordinates of all 
clusters are consistent and a global map is computed. Hence 
we get accurate positioning system. 
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Figure 4: sensor network deployment 

B. Finding the location of a mobile station in a cellular 
network using MDS 

Mobile locations estimation can support many 
applications, such as emergency services, roadside assistance, 
navigation, tracking etc. Various techniques can be used like 
GPS, RSS but these suffer from drawbacks like blockage of 
signal and complex propagation path. 

The MDS has been proposed for wireless network 
based mobile location. Due to the MDS needs distance 
measurement data between the MS and BSs, the MDS 
technique has been proposed for mobile location using the 
TO A (time of arrival) method. Using TO A method, the range 
can be modelled as : r.= d.+n. where r. is range measurement 
between MS and BS.. n. is range error that occurs due to 
multipath fading and propagation losses that includes range 
delay. To estimate mobile location, a square distance matrix 
is constructed as: 
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distance between BS and BS. h is matrix of range between 

i j t & 

BS and MS. From Dt we can obtain the scalar product matrix 
Bt that gives the distance of all BSs and MS from a given 
location that is consider as origin. But distance in matrix Dt 
is pair wise between either in BSs or BS and MS. Bt is 
obtained by double centring technique i.e. subtracting rows 
and columns means from each element. Then add grand mean 
to each element and multiplying by (-1/2). 

When employing the classical MDS, the centroid 
of the all coordinates is assumed at the origin, so the scalar 
matrix, Bt can be express as : 



3, = JET 






Xu 
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(2) 



matrix of BS with is 



<*** L- x ; 3V J denotes the coordinates of BSj. 
By eigenvalue decomposition Bt can be represented as : 

B T= UVU T 
Where V is diagonal matrix of eigenvalues of B . And U is a 
matrix having corresponding eigenvectors. 
From this we can evaluate the value of X as 

x= uv 1/2 

Now if we know the value of X, also all BSs are 
fixed and hence we know the value of coordinates of BSs. 
Thus, from (2), we can find the value of X MS i.e. coordinates 
of MS. 

Classical multidimensional scaling is sensitive to the 
range measurements error in the NLOS (non line of sight) 
scenario. 

Conclusion 

This paper gives an overview of multidimensional 
scaling algorithm and its current practical applications. It is 
an optimization algorithm that has very bright scope in future 
and will help in understanding and solving large complex 
processes which seems to be very difficult in today's scenario. 
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